SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: NI , et al . 

(ii) TITLE OP INVENTION: HUMAN TUMOR NECROSIS FACTOR RECEPTOR 
TRIO 

(iii) NUMBER OP SEQUENCES: 14 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: HUMAN GENOME SCIENCES, INC 

(B) STREET: 9410 KEY WEST AVENUE 

(C) CITY: ROCKVILLE 

(D) STATE: MD 

(E) COUNTRY: US 

(F) ZIP: 20850 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patent In Release #l.o, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) qLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME*: BROOKES, ANDERS A. 

(B) REGISTRATION NUMBER: 36,373 

(C) REFERENCE/DOCKET NUMBER: PF379PP 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (301) 309-8504 

(B) TELEFAX: (301) 309-8512 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 109.. 1266 
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(ix) FEATURE: 

(A) NAME/KEY: sig^peptide.' 

(B) LOCATION: 109. .2 71 

(ix) FEATURE: 

(A) NAME/KEY: 'mat jpep tide 

(B) LOCATION: 274.. 1266 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CGACCCACGC GTCCGCCCAC GCGTCCGGAG AACCTTTGCA CGCGCACAAA CTACGGGGAC 60 

GATTTCTGAT TGATTTTTGG CGCTTTCGAT CCACCCTCCT CCCTTCTC ATG GGA CTT 117 

Met Gly Leu 



■55 



TGG GGA CAA AGC GTC CCG ACC GCC TCG AGC GCT CGA GCA GGG CGC TAT 
Trp Gly Gin Ser Val Pro Thr Ala Ser Ser Ala Arg Ala Gly Arg Tyr 
-50 -45 



165 



CCA GGA GCC AGG ACA GCG TCG GGA ACC AGA CCA TGG CTC CTG GAC CCC 213 
Pro Gly Ala Arg Thr Ala Ser Gly Thr Arg Pro Trp Leu Leu Asp Pro 
-35 -30 -25 • 

AAG ATC CTT AAG TTC GTC GTC TTC ATC GTC GCG GTT CTG CTG CCG GTC 261 
Lys He Leu Lys Phe Val Val Phe He Val Ala Val Leu Leu Pro Val 
-20 -15 -10 ,5 

CGG GTT GAC TCT GCC ACC ATC CCC CGG CAG GAC GAA GTT CCC CAG CAG 309 
Arg Val Asp Ser,Ala Thr He Pro Arg Gin Asp Glu Val Pro Gin Gin 
15 10 

ACA GTG GCC CCA CAG CAA CAG AGG CGC AGC CTC AAG GAG GAG GAG TGT 357 
Thr Val Ala Pro Gin Gin Gin Arg Arg Ser Leu Lys Glu Glu Glu Cys 
15 20 25 

CCA GCA GGA TCT CAT .AGA TCA GAA TAT ACT GGA GCC TGT AAC CCG TGC 4 05 

Pro Ala Gly Ser His Arg Ser Glu Tyr Thr Gly Ala Cys Asn Pro Cys 
30 35 40 

ACA GAG GGT GTG GAT TAC ACC ATT GCT TCC AAC'AAT TTG CCT TCT TGC 4 53 

Thr Glu Gly Val Asp Tyr Thr He Ala Ser Asn Asn Leu Pro Ser Cys 
45 50 55 60 

CTG CTA TGT- ACA GTT TGT AAA TCA GGT CAA ACA AAT AAA AGT TCC TGT 501 
Leu Leu Cys Thr Val Cys Lys Ser Gly Gin Thr Asn ■ Lys Ser Ser Cys 
65 70 75 

ACC ACG ACC AGA GAC ACC GTG TGT CAG TGT GAA AAA GGA AGC TTC CAG 54 9 

Thr Thr Thr Arg Asp Thr Val Cys Gin Cys Glu LyS Gly Ser Phe Gin 
80 85 90 



GAT AAA AAC TCC CCT GAG ATG TGC CGG ACG TGT AGA ACA GGG TGT CCC 
Asp Lys Asn Ser Pro Glu Met Cys Arg Thr Cys Arg Thr Gly Cys Pro 



597 



00 



95 100 105 

AG A GGG ATG GTC AAG GTG AGT AAT TGT ACG CCC-^:GG AGT GAC ATC AAG 645 
Arg Gly Met Val Lys Val Ser Asn Cys Thr Pro Arg Ser Asp lie Lys 
110 115 120 

TGC AAA AAT GAA TCA GCT GCC AGT TCC ACT GGG AAA-"ACC CCA GCA GCG 693 
Cys Lys Asn Glu Ser Ala Ala Ser Ser Thr Gly Lys Thr- Pro Ala Ala 
125 130 135 140 

GAG GAG ACA GTG ACC ACC ATC CTG GGG ATG CTT GCC TCT CCC TAT CAC 741 
Glu Glu Thr Val Thr Thr He Leu Gly Met Leu Ala Ser Pro Tyr His 
145 150 155 

TAC CTT ATC ATC ATA GTG GTT TTA GTC ATC ATT TTA GCT GTG GTT GTG 789 
Tyr Leu He He He Val Val Leu Val He He Leu Ala Val Val Val 
160 165 170 

GTT GGC TTT TCA TGT CGG AAG AAA TTC ATT TCT TAC CTC AAA GGC ATC 837 
Val Gly Phe Ser Cys Arg Lys Lys Phe He Ser Tyr Leu Lys Gly He 
175 180 185 

TGC TCA GGT GGT GGA GGA GGT CCC GAA CGT GTG CAC AGA GTC CTT TTC 88 5 

Cys Ser Gly Gly Gly Gly Gly Pro Glu Arg Val His Arg Val Leu Phe 
190 195 200 

CGG CGG CGT TCA TGT CCT TCA CGA GTT CCT GGG GCG GAG GAC AAT GCC 933 
Arg Arg Arg Ser Cys Pro Ser Arg, Val Pro Gly Ala Glu Asp Asn Ala 
205 210 215 220 

CGC AAC GAG.ACC ^.CTG AGT AAC AGA TAC TTG CAG CCC ACC CAG GTC TCT 981 
Arg Asn Glu Thr Leu Ser Asn Arg Tyr Leu Gin Pro Thr Gin Val Ser 
225 230 235 

GAG CAG GAA ATC CAA GGT CAG GAG CTG GCA GAG CTA ACA GGT GTG ACT 102 9 

Glu Gin Glu He Gin Gly Gin Glu Leu Ala Glu Leu Thr Gly Val Thr 
240 245 250 

GTA GAG TCG CCA GAG GAG CCA CAG CGT CTG CTG GAA CAG GCA GAA GCT 1077 
Val Glu Ser Pro Glu Glu Pro Gin Arg Leu Leu Glu Gin Ala Glu Ala 
255 260 265 

GAA GGG TGT CAG AGG AGG AGG CTG CTG GTT. CCA GTG AAT GAC GCT GAC 112 5 

Glu Gly Cys Gin Arg Arg Arg Leu Leu Val Pro Val Asn Asp Ala Asp 
270 275 280 

TCC GCT GAC ATC AGC ACC TTG CTG GAT GCC TCG GCA ACA CTG GAA GAA 1173 
Ser Ala Asp He Ser Thr Leu Leu Asp Ala Ser Ala Thr Leu Glu Glu 
285 290 295 300 

GGA CAT GCA AAG GAA ACA ATT' CAG GAC CAA CTG GTG GGC TCC GAA AAG 1221 
Gly His Ala Lys Glu Thr He Gin Asp Gin Leu Val Gly Ser Glu Lys 
305 310 315 



CTC TTT TAT GAA GAA GAT GAG GCA GGC TCT GCT ACG TCC TGC CTG 
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Leu Phe Tyr Glu Glu Asp Glu Ala Gly Ser Ala Thr Ser Cys Leu 

320 325 ''-r.. 330 

TGAAAGAATC TCTTCAGGAA ACCAQAGCTT CCCTCATTTA CCTTTTCTCC TACAAAGGGA 1326 

AGCAGCCTGG AAGAAACAGT CCAGTACTTG ACCCATGCCC CAACAAACTC TACTATCCAA 1386 

TATGGGGCAG CTTACCAATG GTCCTAGAAC TTTGTTAACG CACTTGGAGT AATTTTTATG 1446 

AAATACTGCG TGTGATAAGC AAACGGGAGA AATTTATATC AGATTCTTGG CTGCATAGTT 1506 

ATACGATTGT GTATTAAGGG TCGTTTTAGG CCACATGCGG TGGCTCATGC CTGTAATCCC 1566 

AGCACTTTGA TAGGCTGAGG CAGGTGGATT GCTTTGAGCT CGGGAGTTTG AGACCAGCCT 1626 

CATCAACACA GTGAAACTCC ATCTCAATTT AAAAAGAAAA AAAAGTGGTT TTAGGATGTC 1686 

ATTCTTTGCA GTTCTTCATC' ATGAGACAAG TCTTTTTTTC TGCTTCTTAT ATTGCAAGCT 1746 

CCATCTCTAC TGGTGTGTGC ATTTAATGAC ATCTAACTAC AGATGCCGCA CAGCCACAAT 1806 

GCTTTGCCTT ATAGTTTTTT AACTTTAGAA CGGGATTATC TTGTTATTAC CTGTATTTTC 1866 

AGTTTCGGAT ATTTTTGACT TAATGATGAG ATTATCAAGA' CGTAGCCCTA TGCTAAGTCA 1926 

TGAGCATATG GACTTACGAG GGTTCGACTT AGAGTTTTGA GCTTTAAGAT AGGATTATTG 1986 

GGGCTTACCC CCACCTTAAT TAGAGAAACA TTTATATTGC TTACTACTGT AGGCTGTACA 2046 

TCTCTTTTCC GATTTTTGTA TAATGATGTA AACATGGAAA AACTTTAGGA AATGCACTTA 2106 

TTAGGCTGTT TACATGGGTT GCC.TGGATAC AAATCAGCAG TCAAAAATGA CTAAAAATAT 2166 

AACTAGTGAC GGAGGGAGAA ATCCTCCCTC TGTGGGAGGC ACTTACTGCA TTCCAGTTCT 2226 

CCCTCCTGCG CCCTGAGACT GGACCAGGGT TTGATGGCTG GCAGCTTCTC AAGGGGCAGC 2286 

TTGTCTTACT TGTTAATTTT AGAGGTATAT AGCCATATTT ATTTATAAAT AAATATTTAT 234 6 

TTATTTATTT' ATAAGTAGAT GTTTACATAT GCCCAGGATT TTGAAGAGCC TGGTATCTTT 24 06 

GGGAAGCCAT GTGTCTGGTT TGTCGTGCTG GGACAGTCAT GGGACTGCAT CTTCCGACTT 2466 

GTCCACAGCA GATGAGGACA GTGAGAATTA AGTTAGATCC GAGACTGCGA AGAGCTTCTC 2526 

TTTCAAGCGC CATTACAGTT GAACGTTAGT GAATCTTGAG CCTCATTTGG GCTCAGGGCA 2586 

GAGCAGGTGT TTATCTGCCC CGGCATCTGC CATGGCATCA AGAGGGAAGA GTGGACGGTG 2646 . 

CTTGGGAATG GTGTGAAATG GTTGCCGACT CAGGCATGGA TGGGCCCCTC TCGCTTCTGG 2706 

TGGTCTGTGA ACTGAGTCCC TGGGATGCCT TTTAGGGCAG AGATTCCTGA GCTGCGTTTT 2766 

AGGGTACAGA TTCCCTGTTT GAGGAGCTTG GCCCCTCTGT AAGCATCTGA CTCATCTCAG 2826 
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AGATATCAAT TCTTAAACAC TGTGACAACG GGATGTAAAA TGGCTGACAC ATTTGTCCTT 2886 
GTGTCACGTT CCATTATTTT ATTTAAAAAC CTCAGTAATC. QTTTTAGCTT CTTTCCAGCA 2946 
AACTCTTCTC CACAGTAGCC CAGTCGTGGT AGGATAAATT AGGGATATAG TCATTCTAGG 
GGTTTCAGTC TTTTCCATCT CAAGGCATTG TGTGTTTTGT TCCGGGACTG GTTTGGCTGG 
GACAAAGTTA GAACTGCCTG AAGTTCGCAC ATTCAGATTG TTGTGTCCAT GGAGTTTTAG 
GAGGGGATGG CCTTTCCGGT CTTCGCACTT CCATCCTCTC CCCACTTCCC ATCTGGCGTC 
CCACACCTTG TCCCCCTGCA CTTCTGGATG ACCAGGGTGC TGCTGCCTCC TAGTCTTTGC 
CTTTGCTGGG CCTTCTGTGC AGGAGACTTG GTCTCAAAGC TCAGAGAGAG CCAGTCCGGT 

CCCAGCTCCT TTGTCCCTTC CTCAGAGGCC TTCCTTGAAG ATGCATCTAG ACTACCAGCC 
• TTATCAGTGT TTAAGCTTAT TCCTTTAACA TAAGCTTCCT GACAACATGA AATTGTTGGG 

GTTTTTTGGC GTTTGTTGAT TTGTTTAGGT TTTGCTTTAT ACCCGGGCCA AATAGCACAT 

AACACCTGGT TATATATGAA ATACTCATAT GTTTATGACC AAAATAAATA TGAAACCTCA 

AAAAAAAAAA AAAAAAAAAA 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 386 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Gly Leu Trp Gly Gin Ser Val Pro Thr Ala Ser Ser Ala Arg Ala 

-50 -45 .40 

Gly Arg Tyr Pro Gly Ala Arg Thr Ala Ser Gly Thr Arg Pro Trp Leu 
-35 -30 -25 

Leu Asp Pro Lys He Leu Lys Phe Val Val Phe He Val Ala Val Leu 
-20 -IS .10 

Leu Pro Val Arg Val Asp Ser Ala Thr lie Pro Arg Gin Asp Glu Val 
■5 1,5 

Pro Gin Gin Thr Val Ala Pro Gin Gin Gin Arg Arg Ser Leu Lys Glu 
'° 15 20 25 

Glu Glu Cys Pro Ala Gly Ser His Arg Ser Glu Tyr Thr Gly Ala Cvs 
30 35 40 



3006 
3066 
3126 
3186 
3246 
3306 
3366 
3426 
3436 
3546 
3566 
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Asn Pro Cys Thr Glu Gly Val Asp Tyr Thr He Ala Ser Asn Asn Leu 

45 50 v.. 55 

Pro Ser Cys Leu Leu Cys Thr Val Cys Lys Ser Gly Gin Thr Asn Lys 
SO 65 70 

Ser Ser Cys Thr Thr Thr Arg Asp Thr Val Cys Gin . Cys Glu Lys Gly 



75 



80 



85 



Ser Phe Gin Asp Lys Asn Ser Pro Glu Met Cys Arg Thr Cys Arg Thr 

55 100 105 

Gly Cys Pro Arg Gly Met Val Lys Val Ser Asn Cys Thr Pro Arg Ser 

115 120 

Asp He Lys Cys Lys Asn Glu Ser Ala Ala Ser Ser Thr Gly Lys Thr 

130 135 

Pro Ala Ala Glu Glu Thr Val Thr Thr He Leu Gly Met Leu Ala Ser 

145 150 

Pro Tyr His Tyr Leu lie He lie. Val Val Leu Val He lie Leu Ala 
155 160 165 

Val Val Val Val Gly Phe Ser Cys Arg Lys Lys. Phe lie Ser Tyr Leu 

175 180 185 

Lys Gly He Cys Ser Gly Gly Gly Gly Gly Pro Glu Arg Val His Arg 



190 



195- 



200 



Val Leu Phe Arg Arg Arg Ser Cys Pro Ser Arg Val Pro Gly Ala Glu 
205 210 215 

Asp Asn Ala Arg Asn Glu Thr Leu Ser Asn Arg Tyr Leu Gin Pro Thr 
220 225 230 

Gin Val Ser Glu Gin Glu He Gin Gly Gin Glu Leu Ala Glu Leu Thr 
235 240 . 245 

Gly Val Thr Val Glu Ser Pro Glu Glu Pro Gin Arg Leu Leu Glu Gin 
250 255 260 265 

Ala Glu Ala Glu Gly Cys Gin Arg Arg Arg Leu Leu Val Pro Val Asn 
270 275 • 280 

Asp Ala Asp Ser Ala Asp He Ser Thr Leu Leu Asp Ala Ser Ala Thr 
285 290 295 

Leu Glu Glu Gly His Ala Lys Glu Thr He Gin Asp Gin Leu Val Gly 
300 305 310 

Ser Glu Lys Leu Phe Tyr Glu Glu Asp Glu Ala Gly Ser Ala Thr Ser 
315 320 325 

Cys Leu 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 331 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear ■ 

(ii) MOLECULE .TYPE : protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Leu Gly He Trp Thr Leu Leu Pro Leu Val Leu Thr Ser Val Ala 
1 5 10 15 

Arg Leu Ser Ser Lys Ser Val Asn Ala Gin Val Thr Asp He Asn Ser 
20 25 30 

Lys Gly Leu Glu Leu Arg Lys Thr Val Thr Val Glu Thr Gin Asn Leu 
35 40 45 

Glu Gly Leu His His Asp Gly Gin Phe Cys His Pro Cys Pro Pro Gly 
50 55 60 

Glu Arg Lys Ala Arg Asp Cys Thr Val Asn Gly Asp Glu Pro Asp Cys 
65 70 75 80 

Val Pro Cys Gin Glu Gly Lys Glu Tyr Thr Asp Lys Ala His Phe Ser 

85. 90 

Ser Lys Cys Arg Arg Cys Arg Leu Cys Asp Glu Gly His Gly Leu Glu 
100 105 110 

Val Glu He Asn Cys Thr Arg Thr Gin Asn Thr Lys Cys Arg Cys Lys 
115 120 125 

Pro Asn Phe Phe Cys Asn Ser Thr Val Cys Glu His Cys Asp Pro Cys 
130 135 140 

Thr Lys Cys Glu His Gly He He Lys Glu Cys Thr Leu Thr Ser Asn 
145 150 155 160 

Thr Lys Cys Lys Glu Glu Gly Ser Arg Ser Asn Gly Trp Leu Cys Leu 
165 170 175 

Leu Leu Leu Pro He Pro Leu He Val Trp Val Lys Arg Lys Glu Val 
180 185 190 

Gin Lys Thr Cys Arg Lys His Arg Lys Glu Asn Gin Gly Ser His Glu 
195 200 205 
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Ser Pro Thr Leu Asn Pro Glu Thr Val Ala lie Asn Leu Ser Asp Val 

210 215 .... 220 

'.. ^ 

Asp Leu Ser Lys Tyr lie Thr Thr lie AlaGly Val Met Thr Leu Ser 
225 230 235 240 

Gin Val Lys Gly Phe Val Arg Lys Asn Gly Val Asn Glu Ala Lys lie 
'245 250 255 



Asp Glu lie Lys Asn Asp Asn Val 

. 260 

Gin Leu Leu Arg Asn Trp His Gin 
275 280 



Gin Asp Thr Ala Glu Gin Lys Val 
265 270 

Leu His Gly Lys Lys Glu Ala Tyr 
285 



Asp Thr Leu He Lys Asp Leu Lys Lys Ala Asn Leu Cys Thr Leu Ala 
290 295 300 

Glu Lys He Thr He He Leu Lys Asp He Thr Ser Asp Ser Glu Asn 
305 310 315 320 

Ser Asn Phe Arg Asn Glu He Gin Ser Leu Val 
325 330 



) INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 427 amino acids 
•(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Gly Ala Gly Ala Thr Gly Arg Ala Met Asp Gly Pro Arg Leu Leu 
^ 5 . 10 15 

Leu Leu Leu. Leu Leu Gly Val Ser Leu Gly Gly Ala Lys Glu Ala Cys 
20 25 30 

Pro Thr Gly Leu Tyr Thr His Ser Gly Glu Cys Cys Lys Ala Cys Asn 
35 40 45 

Leu Gly Glu Gly Val Ala Gin Pro Cys Gly Ala Asn Gin Thr Val Cys 

^0 55 . . 60 

Glu Pro Cys Leu Asp Ser Val Thr Phe Ser Asp Val Val Ser Ala Thr 
^5 - 70 75 80 

Glu Pro Cys Lys Pro Cys Thr Glu Cys Val Gly Leu Gin Ser Met Ser 
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85 90 , 95 ■ 

Ala Pro Cys Val Glu Ala Asp Asp Ala Val Cys Arg Cys Ala Tyr Gly 
100 105 110 

Tyr Tyr Gin Asp Glu Thr Thr Gly Arg Cys Glu Ala Cys Arg Val Cys 
115 120 125 

Glu Ala Gly Ser Gly Leu Val Phe Ser Cys Gin Asp Lys Gin Asn Thr 
130 ■ . 135 • 140 

Val Cys Glu Glu Cys Pro Asp Gly Thr Tyr Ser Asp Glu Ala Asn His 
145 , 150 155 160 

Val Asp Pro Cys Leu Pro Cys Thr Val Cys Glu Asp Thr Glu Arg Gin 
165 170 175 

Leu Arg Glu Cys Thr Arg Trp Ala Asp Ala Glu Cys Glu Glu lie Pro 
180 185 ' 190 

Gly Arg Trp He Thr Arg Ser Thr Pro Pro Glu Gly Ser Asp Ser Thr 
195 200 205 

Ala Pro Ser Thr Gin Glu Pro Glu Ala Pro Pro Glu Gin Asp Leu He 
210 215 220 

Ala Ser Thr. Val Ala Gly Val Val Thr Thr Val Met Gly Ser Ser Gin 
225 230 23'5 240 

Pro Val ValvThr Arg Gly Thr Thr Asp Asn Leu He Pro Val Tyr Cys 
245 250 255 



Ser He Leu Ala Ala Val Val Val Gly Leu Val Ala Tyr He Ala Phe 
260 -265 270 

Lys Arg Trp Asn Ser Cys Lys Gin Asn Lys Gin Gly Ala Asn Ser Arg 
275 280 285 

Pro Val Asn Gin Thr Pro Pro Pro Glu Gly Glu Lys Leu His Ser Asp 
290 295 300 

Ser Gly He Ser Val- Asp Ser Gin Ser Leu His Asp Gin Gin Pro His 
305 310 315 320 

Thr Gin Thr Ala Ser Gly Gin Ala Leu Lys Gly Asp Gly Gly Leu Tyr 
325 330 335 

Ser Ser Leu Pro Pro Ala Lys Arg Glu Glu Val Glu Lys Leu Leu Asn 
340 345 350 

Gly Ser Ala Gly Asp Thr Trp Arg His Leu Ala Gly Glu Leu Gly Tyr 
355 360 365 



Gin Pro Glu His He Asp Ser Phe Thr His Glu Ala Cys Pro Val Arg 
370 375 380 
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Ala Leu Leu Ala Ser Trp Ala Thr Gin Asp Ser Ala Thr Leu Asp Ala 

385 390 --..395 400 

Leu Leu Ala Ala Leu Arg Arg lie Gin Arg Ala Asp Leu Val Glu. Ser 
405 410 415 

Leu Cys Ser Glu Ser Thr Ala Thr Ser Pro Val 
420 425 

2) INF0R^4ATI0N FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 53 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: .single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Gly Leu Ser Thr Val Pro Asp Leu Leu Leu Pro Leu Val Leu Glu 
^ 5 ' 10 15 

Leu Leu Val , Gly lie Tyr Pro Ser Gly Val He Gly Leu Val Pro His 
20 25 30 

Leu Gly Asp Arg Glu Lys Arg Asp Ser Val Cys Pro Gin Gly Lys Tyr 
35 40 45 

He His Pro Asn Asn Ser He Cys Cys Thr Lys Cys His Lys Gly Thr 
50 55 60 - 

Tyr Leu Tyr Asn Asp Cys Pro Gly Pro Gly Gin Asp Thr Asp Cys Arg 
. 70 75 80 

Glu Cys Glu Ser Gly Ser Phe Thr Ala Ser Glu Asn His Leu Arg His 
85 90 95 

Cys Leu Ser Cys Ser Lys Cys Arg Lys Glu Met Gly Gin Val Glu He 
100 105 110 

Ser Ser Cys Thr Val Asp Arg Asp Thr Val Cys Gly Cys Arg Lys Asn 
115 120 125 

Gin Tyr Arg His Tyr Trp Ser Glu Asn Leu Phe Gin Cys Phe Asn Cys 
130 135 140 

Ser Leu Cys Leu Asn Gly Thr Val His Leu Ser Cys Gin Glu Lys Gin 

150 155 160 

Asn Thr Val Cys Thr Cys His Ala Gly Phe Phe Leu Arg Glu Asn Glu 
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170 



175 



Cys Val Ser Cys Ser Asn Cys Lys Lys S^:^. Leu Glu Cys Thr Lys Leu 
180 185 .190 

Cys Leu Pro Gin lie Glu Asn Val Lys Gly Thr Glu Asp Ser Gly Thr 
• 195 200 205 

Thr Val Leu Leu Pro- Leu Val lie Phe Phe Gly Leu Cys Leu Leu Ser 
210 215 220 

Leu Leu Phe He Gly Leu Met Tyr Arg Tyr Gin Arg Trp Lys Ser Lys 
225 .230 235 240 

Leu Tyr Ser lie Val Cys Gly Lys Ser Thr Pro Glu Lys Glu Gly Glu 
245 250 255 

Leu Glu Gly Thr Thr Thr Lys Pro Leu Ala Pro Asn Pro Ser Phe Ser 
260 265 270 

Pro Thr Pro Gly Phe Thr Pro Thr Leu Gly Phe Ser Pro Val Pro Ser 
275 280 285 

Ser Thr Phe Thr Ser Ser Ser Thr Tyr Thr Pro Gly Asp Cys Pro Asn 
290 295 300 

Phe Ala Ala Pro Arg Arg Glu Val Ala Pro Pro Tyr Gin Gly Ala Asp 
305 310 315 320 

Pro Xle Leu Ala Thr Ala Leu Ala Ser Asp Pro He Pro Asn Pro Leu 
325 330 335 

Gin Lys Trp Glu Asp Ser Ala His Lys Pro Gin Ser Leu Asp Thr Asp 
340 , 345 350 

Asp Pro Ala Thr Leu Tyr Ala Val Val Glu Asn Val Pro Pro Leu Arg 
355 360 365 

Trp Lys Glu Phe Val Arg Arg Leu Gly Leu Ser Asp His Glu He Asp 
370 375 380 

Arg Leu Glu Leu Gin Asn Gly Arg Cys Leu Arg Glu Ala Gin Tyr Ser- 
385 390 395 400 

Met Leu Ala Thr Trp Arg Arg Arg Thr Pro Arg Arg Glu Ala Thr Leu 
405 410 415 

Glu Leu Leu Gly Arg Val Leu Arg Asp Met Asp Leu Leu Gly Cys Leu 
420 425 430 



Glu Asp lie Glu Glu Ala Leu Cys Gly Pro Ala Ala Leu Pro Pro Ala 
'435 440 445 



Pro Ser Leu Leu Arg 
450 
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) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 467 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

'(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Pro Pro Pro Ala Arg Val His Leu Gly Ala Phe Leu Ala Val 
^5 10 15 

Thr Pro Asn Pro Gly Ser Ala Ala Ser Gly Thr Glu Ala Ala Ala Ala 
20 25 30 

Thr Pro Ser Lys Val' Trp Gly Ser Ser Ala Gly Arg He Glu Pro Arg 
35 40 45 

Gly Gly Gly Arg Gly Ala Leu Pro Thr Ser Met Gly Gin His Gly Pro 
50 55 60 

Ser Ala Arg Ala Arg Ala Gly Arg Ala Pro Gly Pro Arg Pro Ala Arg 
^5 70 75 80 

Glu Ala Ser .Pro Arg Leu Arg Val His Lys Thr Phe Lys Phe Val Val 
85 90 95 

Val Gly Val Leu Leu Gin Val Val Pro Ser Ser Ala Ala Thr He Lys 
100 105 110 

Leu His Asp Gin Ser He Gly Thr Gin Gin Trp Glu His Ser Pro Leu 
115 120 125 

Gly Glu Leu Cys Pro Pro Gly Ser His Arg Ser Glu Arg Pro Gly Ala 
130 135 140 

Cys Asn Arg Cys Thr Glu Gly Val Gly Tyr Thr Asn Ala Ser Asn Asn 
145 150 155 160 

Leu Phe Ala Cys Leu Pro Cys Thr Ala Cys Lys Ser Asp Glu Glu Glu 
1S5 170 175 

Arg Ser Pro Cys Thr Thr Thr Arg Asn Thr Ala Cys Gin Cys Lys Pro 
180 185 190 

Gly Thr Phe Arg Asn Asp Asn Ser Ala Glu Met Cys Arg Lys Cys Ser 
195 200 205 

Thr Gly Cys Pro Arg Gly Met Val Lys Val Lys Asp Cys Thr Pro Trp 
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210 215 220 

Ser Asp lie Glu Cys Val His Lys Glu Ser -tsly Asn Gly His Asn lie 
225 230 235 240 

Trp Val He Leu Val Val Thr Leu Val Val Pro Leu Leu Leu Val Ala 
245 250 255 

Val Leu He Val Cys Cys Cys He Gly Ser Gly Cys Gly Gly Asp Pro 
260 265 270 

Lys Cys Met Asp Arg Val Cys Phe Trp Arg Leu Gly Leu Leu Arg Gly 
275 280 285 

Pro Gly Ala Glu Asp Asn Ala His Asn Glu He Leu Ser Asn- Ala Asp 
290 295 300 

Ser Leu Ser Thr Phe Val Ser Glu Gin Gin Met Glu Ser Gin Glu Pro 
305 310 315 320 

Ala Asp Leu Thr Gly Val Val Gin Ser Pro Gly Glu Ala Gin Cys Leu 
325 330 335 

Leu Gly Pro Ala Glu Ala Glu Gly Ser Gin Arg Arg Arg Leu Leu Val 
340 345 350 

Pro Ala Asn Gly Ala Asp Pro Thr Glu Thr Leu Met Leu Phe Phe Asp 
355' 360 365 

Lys Phe Ala Asn He Val Pro Phe Asp Ser Trp Asp Gin Leu Met Arg 
370 ' • 375 380 

Gin Leu Asp Leu Thr Lys Asn Glu He Asp Val Val Arg Ala Gly Thr 
385 390 395 400 

Ala Gly Pro Gly Asp Ala Leu Tyr Ala Met Leu Met Lys Trp Val Asn 
405 410 415 

Lys Thr Gly Arg Asn Ala Ser He His Thr Leu Leu Asp Ala Leu Glu 
420 425 430 

Arg Met Glu Glu Arg His Ala Lys Glu Lys. He Gin Asp Leu Leu Val 
435' 440 445 

Asp Ser Gly Lys Phe He Tyr Leu Glu Asp Gly Thr Gly Ser Ala Val 
450 455 460 

Ser Leu Glu 
465 

INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 343 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
GTCACGTTCC ATTATTTTAT TTAAAAACCT CAGTAATCGT TTTAGCTTCT TTCCAGCAAA 
CTCTTCTCCA CAGTAGCCCA GTCGTGGTAG GATAAATTAC GGATATAGTC ATTCTAGGGG 
TTTCAGTCTT TTCCATCTCA AGGCATTGTG TGTTTTGTTC CGGGACTGGT TTGGCTGGGA 
CAAAGTTAGA ACTGCCTGAA GTTCGCACAT TCAGATTGTT GTGTCCATGG AGTTTTAGGA 
GGGGATGGCC TTTCCGGTCT TCGCACTTCC ATCCTCTCCC ACTTCCATCT GGCGTCCACA 
ACTTGTCCCC TGCACTTCTG GATGACACAG GGTGCTGCTG CCT 
(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

GTGGACGGTG CTTGGGAATG GTGTGAAATG GTTGCCGACT CAGGCATGGA TGGGCCCCTC ' 60 

TCGCTTCTGG TGGTCTGTGA ACTGAGTCCC TGGGATGCCT TTAGGGCAGA GATTCCTGAG 120 

CTGCGTTTTA GGGTACAGAT TCCCTGTTTG AGGAGCTTGG CCCCTCTGTA AGCGTCTGAC 180 

TCATCTCAGA GATATCAATT CTTAAACACT GTGACAACGG GATCTAAAAT GGCTGACACA 24 0 

TTTGTCCTTG TGTCACGTTC CATTATTTTA TTTAAAATT 279 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 250 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



60 
120 
180 
240 
300 
343 
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(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : ^; 

GGCCACGTAG TGCCACGTGC CACAAACTAC GGGGGACGAT TTCTGATTGA ATTTTTGGCG 60 

CTTTCAATCC ACCCTCCTCC CTTCTAATGG GACTTTGGGG ACAAAGGTCC GACCGCCTCG 12 0 

AGCGTCGACA GGGCGCTATC CAGGAGCCAG GACAGCGTCG GGAACCAGAC CATGGCTCCT 180 

GGACCCCAAG ATCCTTAAGT TCGTCGTCTT CATCGTCGGG TTCTCTGCCG. GTAAGTTAGG 24 0 

AGGTCCCTGG 250 
(2) INFORMATION FOR SEQ ID NO: 10: ■ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:10: 
CGCCCATGGC CACCATCCCC CGGCAG 26 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:11: 

CGCAAGCTTT TAGTAGTGAT AGGGAGAGGC 3 0 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 3 base pairs 
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(B) ' TYPE: nucleic acid 

(C) STRANDEDNESSTsingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

i 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12: 

CGCGGATCCG CCATCATGGG ACTTTGGGGA CAA 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
(j1 (A) LENGTH: 30 base pairs 

□ (B) TYPE: nucleic acid 

□ (C) STRANDEDNESS : single 
in* (D)' TOPOLOGY: linear 

Q (ii) MOLECULE TYPE: DNA (genomic) 

U 

n 

.! (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

j CGCGGTACCT TAGTAGTGAT AGGGAGAGGC 

j (2) INFORMATION FOR SEQ ID NO: 14: 

1 

v(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CGCTCTAGAT CAAGCGTAGT CTGGGACGTC GTATGGGTAG TAAGTGATAG GGAGAGGC 
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